Lexical and Acoustic Adaptation for Multiple Non-Native English Accents
نویسندگان
چکیده
This work investigates the impact of non-native English accents on the performance of an large vocabulary continuous speech recognition (LVCSR) system. Based on the GlobalPhone corpus [1], a speech corpus was collected consisting of English sentences read by native speakers of Bulgarian, Chinese, German and Indian languages. To accommodate for non-native pronunciations, two directions are followed: Modification of the dictionary to better reflect the non-native pronunciations and adaptation of the acoustic models for native US English with non-native speech. The proposed methods for dictionary modification are data-driven. Therefore no language-specific rules are necessary: The idea is to extract a parallel corpus of phoneme sequences from phonetic transcriptions of native US English and accented English in the George Mason University (GMU) accented database [2]. With this corpus, Statistical Machine Translation models are generated to translate the US English pronunciations in the GlobalPhone dictionary into accented pronunciations which are then used as new pronunciation variants in the GlobalPhone dictionary. With the combination of the lexical and acoustic model approaches, relative improvements of 26.9% for Bulgarian, 33.2% for Chinese, 30.9% for German, and 53.2% for Indian accents are achieved.
منابع مشابه
MLLR-based accent model adaptation without accented data
When the user has an accent different from what the automatic speech recognization system is trained with, the performance of the systems degrades. This is attributed to both acoustic and phonological differences between accents. The phonological differences between two accents are due to different phoneme inventories in two languages. Even for the same phoneme, foreigners and native speakers p...
متن کاملInfluences on perceived foreign accentedness: Acoustic distances and lexical neighborhoods
We often encounter speakers with non-native accents who produce speech different from typical native productions [1]. Native listeners are sensitive to these deviations and the degree of deviation along acoustic variables has been shown to affect how native listeners assign accentedness ratings to nonnative speech [7]. Additionally, lexical variables may influence the perception of foreign acce...
متن کاملComparing Lexical Bundles in Hard Science Lectures; A Case of Native and Non-Native University Lecturers
Researchers stated that learning and applying certain set of lexical bundles of native lecturers by non-native lecturers would help students improve their proficiency through incidental vocabulary input. The present study shed light on the lexical bundles in hard science lectures used by Native and Non-native lecturers in international universities with the main purpose of analyzing the structu...
متن کاملNon-native Pronunciation Modeling in a Command & Control Recognition Task: A Comparison between Acoustic and Lexical Modeling
In order to improve automatic recognition of English commands spoken by non-native speakers, we have modeled non-native pronunciation variation of Dutch, French and Italian. The results of lexical and acoustical modeling appeared to be source language and speaker dependent. Lexical modeling only resulted in a substantial improvement (of 35%) for the French speakers. Acoustic model adaptation ha...
متن کاملLexical Adaptation to a Novel Accent in German: A Comparison Between German, Swedish, and Finnish Listeners
Listeners usually adjust rapidly to unfamiliar regional and foreign accents in their native (L1) language. Non-native (L2) listeners, however, usually struggle when confronted with unfamiliar accents in their non-native language. The present study asks how native language background of L2 speakers influences lexical adjustments in a novel accent of German, in which several vowels were systemati...
متن کامل